Bottom-Up Top-Down Cues for Weakly-Supervised Semantic Segmentation

机译：弱监督语义分割的自下而上自上而下的线索

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We consider the task of learning a classifier for semantic segmentation usingweak supervision in the form of image labels which specify the object classespresent in the image. Our method uses deep convolutional neural networks (CNNs)and adopts an Expectation-Maximization (EM) based approach. We focus on thefollowing three aspects of EM: (i) initialization; (ii) latent posteriorestimation (E-step) and (iii) the parameter update (M-step). We show thatsaliency and attention maps, our bottom-up and top-down cues respectively, ofsimple images provide very good cues to learn an initialization for theEM-based algorithm. Intuitively, we show that before trying to learn to segmentcomplex images, it is much easier and highly effective to first learn tosegment a set of simple images and then move towards the complex ones. Next, inorder to update the parameters, we propose minimizing the combination of thestandard softmax loss and the KL divergence between the true latent posteriorand the likelihood given by the CNN. We argue that this combination is morerobust to wrong predictions made by the expectation step of the EM method. Wesupport this argument with empirical and visual results. Extensive experimentsand discussions show that: (i) our method is very simple and intuitive; (ii)requires only image-level labels; and (iii) consistently outperforms otherweakly-supervised state-of-the-art methods with a very high margin on thePASCAL VOC 2012 dataset.

机译：我们考虑使用弱监督以图像标签的形式学习用于语义分割的分类器的任务，图像标签指定了图像中存在的对象类别。我们的方法使用深度卷积神经网络（CNN），并采用基于期望最大化（EM）的方法。我们集中在以下三个方面：（i）初始化；（ii）潜在后验重估（E步）和（iii）参数更新（M步）。我们显示了简单图像的显着性图和注意力图，分别是我们的自下而上和自上而下的线索，为学习基于EM的算法的初始化提供了很好的线索。直观地表明，在尝试学习对复杂图像进行分割之前，先学习对一组简单图像进行细分然后转向复杂图像是更加容易和高效的。接下来，为了更新参数，我们建议最小化标准softmax损失和真实潜在后验概率（CNN）给出的KL散度的组合。我们认为，这种组合对于由EM方法的预期步骤做出的错误预测更为可靠。我们以经验和视觉结果支持这一论点。大量的实验和讨论表明：（i）我们的方法非常简单直观。（ii）仅需要图像级标签；（iii）在PASCAL VOC 2012数据集上始终以非常高的优势持续优于其他弱监督的最新方法。

著录项

作者
Hou, Qinbin; Dokania, Puneet Kumar; Massiceti, Daniela; Wei, Yunchao; Cheng, Ming-Ming; Torr, Philip;
展开▼
作者单位

展开▼
年度 2017
总页数
原文格式 PDF
正文语种
中图分类

相似文献

外文文献
中文文献
专利

1. Robust aircraft segmentation from very high-resolution images based on bottom-up and top-down cue integration [J] . Gao Feng, Xu Qizhi, Li Bo Journal of Applied Remote Sensing . 2016,第1期

机译：基于自下而上和自上而下的提示集成，从超高分辨率图像中进行可靠的飞机分割
2. OBJCUT: Efficient Segmentation Using Top-Down and Bottom-Up Cues [J] . Kumar M. Pawan, Torr Philip H.S., Zisserman Andrew Pattern Analysis and Machine Intelligence, IEEE Transactions on . 2010,第3期

机译：OBJCUT：使用自上而下和自下而上的提示进行有效的细分
3. Integration of top-down and bottom-up visual processing using a recurrent convolutional-deconvolutional neural network for semantic segmentation [J] . Intelligent Service Robotics . 2020,第1期

机译：使用反复卷积 - 解卷积神经网络进行自上而下和自下而上的视觉处理的集成，用于语义分割
4. Bottom-Up Top-Down Cues for Weakly-Supervised Semantic Segmentation [C] . Qibin Hou, Daniela Massiceti, Puneet Kumar Dokania, International conference on energy minimization methods in computer vision and pattern recognition . 2018

机译：自底向上的提示，用于弱监督的语义分割
5. Weakly-Supervised Semantic Segmentation in the Multi-Class Setting across Different Image Domains [D] . Chan, Lyndon Hin-Cheung. 2020

机译：不同图像域的多级环境中的弱监督语义细分
6. Independent effects of bottom-up temporal expectancy and top-down spatial attention. An audiovisual study using rhythmic cueing [O] . Alexander Jones 2014

机译：自下而上的时间预期和自上而下的空间注意的独立影响。有节奏提示的视听研究
7. Weakly-Supervised Semantic Segmentation using Motion Cues [O] . Tokmakov, Pavel, Alahari, Karteek, Schmid, Cordelia 2017

机译：使用运动线索进行弱监督的语义分割
8. Unsupervised Tattoo Segmentation Combining Bottom-Up and Top-Down Cues. [R] . Allen, J. D., Zhao, N., Yuan, J., 2013

机译：结合自下而上和自上而下的线索的无监督纹身分割。

Bottom-Up Top-Down Cues for Weakly-Supervised Semantic Segmentation

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅